ViT-SAPS: Detail-Aware Transformer for Mechanical Assembly Semantic Segmentation

نویسندگان

چکیده

Semantic segmentation of mechanical assembly images provides an effective way to monitor the process and improve product quality. Compared with other deep learning models, Transformer has advantages in modeling global context, it been widely applied various computer vision tasks including semantic segmentation. However, pays same granularity attention on all regions image, so some difficulty be images, which parts have large size differences information quantity distribution is uneven. This paper proposes a novel Transformer-based model called Vision Self-Adaptive Patch Size (ViT-SAPS). ViT-SAPS can perceive detail image finer-grained where locates, thus meeting requirements Specifically, self-adaptive patch splitting algorithm proposed split into patches sizes. The more region has, smaller into. Further, handle these unfixed-size patches, position encoding scheme non-uniform bilinear interpolation used after sequence decoding are proposed. Experimental results show that stronger ability than fixed size, achieves impressive locality-globality trade-off. study not only practical method for segmentation, but also much value application Transformers fields. code available at: https://github.com/QDLGARIM/ViT-SAPS.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Importance-Aware Semantic Segmentation for Autonomous Driving System

Semantic Segmentation (SS) partitions an image into several coherent semantically meaningful parts, and classifies each part into one of the predetermined classes. In this paper, we argue that existing SS methods cannot be reliably applied to autonomous driving system as they ignore the different importance levels of distinct classes for safedriving. For example, pedestrians in the scene are mu...

متن کامل

Contour-aware network for semantic segmentation via adaptive depth

Semantic segmentation has been widely investigated for its important role in computer vision. However, some challenges still exist. The first challenge is how to perceive semantic regions with various attributes, which can result in unbalanced distribution of training samples. Another challenge is accurate semantic boundary determination. In this paper, a contour-aware network for semantic segm...

متن کامل

Semantic Constraint and QoS-Aware Large-Scale Web Service Composition

Service-oriented architecture facilitates the running time of interactions by using business integration on the networks. Currently, web services are considered as the best option to provide Internet services. Due to an increasing number of Web users and the complexity of users’ queries, simple and atomic services are not able to meet the needs of users; and to provide complex services, it requ...

متن کامل

A Context-aware Architecture for Mental Model Sharing through Semantic Movement in Intelligent Agents

Recent studies in multi-agent systems are paying increasingly more attention to the paradigm of designing intelligent agents with human inspired concepts. One of the main cognitive concepts driving the core of many recent approaches in multi agent systems is shared mental models. In this paper, we propose an architecture for sharing mental models based on a new concept called semantic movement....

متن کامل

A semantic-aware role-based access control model for pervasive computing environments

Access control in open and dynamic Pervasive Computing Environments (PCEs) is a very complex mechanism and encompasses various new requirements. In fact, in such environments, context information should be used in access control decision process; however, it is not applicable to gather all context information completely and accurately all the time. Thus, a suitable access control model for PCEs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3270807